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Abstract 

As originally proposed, type classes provide overloading and ad- 
hoc definition, but can still be understood (and implemented) in 
terms of strictly parametric calculi. This is not true of subsequent 
extensions of type classes. Functional dependencies and equality 
constraints allow the satisfiability of predicates to refine typing; this 
means that the interpretations of equivalent qualified types may not 
be interconvertible. Overlapping instances and instance chains al¬ 
low predicates to be satisfied without determining the implemen¬ 
tations of their associated class methods, introducing truly non- 
parametric behavior. We propose a new approach to the semantics 
of type classes, interpreting polymorphic expressions by the be¬ 
havior of each of their ground instances, but without requiring that 
those behaviors be parametrically determined. We argue that this 
approach both matches the intuitive meanings of qualified types 
and accurately models the behavior of programs. 

Categories and Subject Descriptors D.3.1 [Programming Lan¬ 
guages ]: Formal Definitions and Theory—Semantics; F.3.2 [Log¬ 
ics and Meanings of Programs]: Semantics of Programming Lan¬ 
guages—Denotational semantics 
Keywords overloading; type classes; semantics 

1. Introduction 

Implicit polymorphism (as provided by the Hindley-Milner type 
systems in ML and Haskell) provides a balance between the safety 
guarantees provided by strong typing, and the convenience of 
generic programming. The Hindley-Milner type system is strong 
enough to guarantee that the evaluation of well-typed terms will 
not get stuck, while polymorphism and principal types allow pro¬ 
grammers to reuse code and omit excessive type annotation. Type 
classes [16] play a similar role for overloading: they preserve strong 
typing (ruling out run-time failures from the use of overloaded 
symbols in undefined ways) without requiring that programmers 
explicitly disambiguate overloaded expressions. Since their intro¬ 
duction, type classes have seen numerous extensions, such as multi¬ 
parameter type classes, functional dependencies [5], and overlap¬ 
ping instances [13]; a variety of practical uses, from simple over¬ 
loading to capturing complex invariants and type-directed behav¬ 
ior; and, the adoption of similar approaches in other strongly-typed 
programming languages, including Isabelle and Coq. 
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1.1 Dictionary-Passing and its Disadvantages 

The semantics of type classes has primarily been given by trans¬ 
lations from instance declarations and (implicit) overloading to 
dictionaries and (explicit) dictionary arguments. This parallels the 
treatment of implicit polymorphism by translation to systems with 
explicit polymorphism (such as System F), and shares similar chal¬ 
lenges. For a simple example, in Haskell, the map function has the 
polymorphic type scheme (/ -A u) -A [/] -A [u] . In translating to 
System F, this could be interpreted as either 
Vf.Vw.(t —> u) [f] -A [u] or Vw.Vt.(f -A m) -A [t] -A [u\. 

But these types are not equivalent: they express different orders 
of passing type arguments. There are various ways of addressing 
this discrepancy: for example, Mitchell [8] shows that, for any two 
translations of an implicitly typed scheme, there is a term (which he 
calls a retyping function) which transforms terms of one translation 
to terms of the other, while only manipulating type abstractions and 
applications. Similar issues arise in the semantics of type classes. 
For example, a function to compare pairs (/, u) for equality could 
be given either the type scheme 

(Eq t, Eq u) =* (t, u) -a (t, u) -A Bool 
or the type scheme 

(Eq u. Eq t) =>■ (t, u) -A (t, u) -A Bool. 

In a dictionary-passing translation, type classes are interpreted by 
tuples, called dictionaries, containing the type-specific implemen¬ 
tations of each of the class methods. Class instances correspond to 
dictionary definitions, while predicates in types correspond to dic¬ 
tionary arguments. In the case of the Eq class, which has equality 
and inequality methods, we could define Eq dictionaries by 
EqDict t spit A(A Bool, t -¥ t - A Bool). 

Even though the two types for pair equality above are equivalent in 
the implicitly overloaded setting, their dictionary-passing transla¬ 
tions are not: the first corresponds to a function of type 

EqDict t -A EqDict u -A (/, u) -A (t, u) -A Bool, 
while the second corresponds to 

EqDict u -A EqDict t -A (/, u) -A (f, u) -A Bool, 

Again, approaches exist to address this discrepancy: for exam¬ 
ple, lones shows [3] that there are conversion functions, similar 
to Mitchell’s retyping functions, to convert between different trans¬ 
lations of the same overloaded term. 

Our own work began by exploring instance chains [10], a pro¬ 
posed extension to Haskell-like type class systems. In the course 
of this exploration, we discovered several difficulties with existing 
approaches to the semantics of overloading. 

Mismatch in expressivity. System F typing is significantly more 
expressive than the Hindley-Milner type systems it is used to 
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model. In particular, even within the translation of an ML or 
Haskell type scheme, there are arbitrarily many expressions that 
do not correspond to any expressions of the source language. 
The problem is compounded when considering dictionary-passing 
translations of type classes. For example, there is no notion in 
Haskell of class instances depending on terms; on the other hand, 
there is no difficulty in defining a term of type Int —> EqDict Int. 
Uses of such a term cannot be equivalent to any use of the methods 
of Eq. As a consequence, there are properties of source programs 
(for example, that any two instances of ssei at the same type are 
equal) that may not be provable of their dictionary-passing transla¬ 
tion without reference to the specific mechanisms of translation. 

Predicates refine typing. Second, the notions of equivalence of 
System F and Haskell types diverge once the satisfiability of pred¬ 
icates can refine typing. For example, functional dependencies al¬ 
low programmers to declare that some parameters of a class depend 
upon others; in the declaration 

class Elems c e | c 4 e where 
empty :: c 

insert :: e —» c —» c 

the dependency c —1 e captures the intuition that the type of a 
container’s elements are determined by the type of the container. 
Concretely, given two predicates Elems tv and Elems ¥ v', if we 
know that r = r', then we can conclude v = v'. This property 
is lost in the dictionary-passing translation. Dictionaries for Elems 
contain just their methods: 

ElemsDictce = (c,e —¥ c c) 

As a consequence, there are types that are equivalent in Haskell, 
but are not interconvertible in the dictionary-passing interpretation. 
For example, the type (Elems ce, Elems ce') => e —> e’ —¥ c is 
equivalent to the (simpler) type (Elems ce) => e — le—t c as 
we must have that e = e' for the qualifiers in the first type to be 
satisfiable. However, there is no corresponding bijection between 
terms of type ElemsDictce ElemsDictce' —te—¥e'—lc 
and terms of type ElemsDictce e e c. While we can 
construct a term of the second type given a term of the first, there 
is no parametric construction of a term of the first type from a term 
of the second. 

Non-parametric behavior. Finally, other extensions to class sys¬ 
tems make it possible to define terms which have no translation 
to parametric calculi. For example, we could define a function 
invBool that negated booleans and was the identity on all other 
types. We begin by introducing a suitable class: 

class Univ t where 
invBool :: t -—>■ t 

There are several approaches to populating the class, using differ¬ 
ent extensions of the Haskell class system. Using overlapping in¬ 
stances [13], we could simply provide the two desired instances of 
the class, relying on the type checker to disambiguate them based 
on their specificity: 

instance Univ Bool where 
invBool = not 
instance Univ t where 
invBool = id 

Using instance chains, we would specify the ordering directly: 

instance Univ Bool where 
invBool = not 
else Univ t where 
invBool = id 


With either of these approaches, we might expect that the type of 
the class method invBool is (Univ t) => t t. However, the 
predicate Univ r is provable for arbitrary types r. Thus, the above 
type is intuitively equivalent to the unqualified type t —¥ t\ however, 
there is no term of that type in a purely parametric calculus that has 
the behavior of method invBool. (In practice, this is avoided by 
requiring that invBool’s type still include the Univ predicate, even 
though it is satisfied in all possible instantiations; while this avoids 
the difficulties in representing invBool in a parametric calculus, it 
disconnects the meaning of qualified types from the satisfiability of 
their predicates.) 

1.2 Specialization-based Semantics 

We propose an alternative approach to the semantics of type-class 
based implicit overloading. Rather than interpret polymorphic ex¬ 
pressions by terms in a calculus with higher-order polymorphism, 
we will interpret them as type-indexed collections of (the inter¬ 
pretations of) monomorphic terms, one for each possible ground 
instantiation of their type. We call this a specialization-based ap¬ 
proach, as it relates polymorphic terms to each of their (ground- 
typed) specializations. We believe this approach has a number of 
advantages. 

• First, our approach interprets predicates directly as restrictions 
of the instantiation of type variables, rather than through an in¬ 
termediate translation. Consequently, properties of the source 
language type system—such as the type refinement induced by 
the Elems predicates—are immediately reflected in the seman¬ 
tics, without requiring the introduction of coercions. 

• Second, our approach naturally supports non-parametric exam¬ 
ples, such as class Univ, and avoids introducing artificial dis¬ 
tinction between the semantics of expressions using parametric 
and ad-hoc polymorphism. 

• Third, because our approach does not need to encode overload¬ 
ing via dictionaries, it becomes possible to reason about class 
methods directly, rather than through reasoning about the col¬ 
lection of dictionaries defined in a program. 

Our approach builds on Ohori’s simple semantics for ML polymor¬ 
phism [12], extended by Harrison to support polymorphic recur¬ 
sion [1], 

In this paper, we introduce a simple overloaded language called 
H _ (§2), and give typing and equality judgments in the presence of 
classes and class methods. We apply our specialization-based ap¬ 
proach to give a denotational semantics of H _ (§3), and show the 
soundness of typing and equality with respect to the denotational 
semantics (§4). We also develop two examples, to demonstrate the 
advantages of our approach. First, we consider a pair of definitions, 
one parametric and the other ad-hoc, defining operational equiva¬ 
lent terms. We show that the defined terms are related by our equal¬ 
ity judgment (§2.3) and have the same denotations (§3.5). This 
demonstrates the flexibility of our approach, and the ability to rea¬ 
son about class methods directly (the second and third advantages 
listed above). Second, we extend H“ with functional dependen¬ 
cies (§5), and establish the soundness of the (extended) typing and 
equality judgments, all without having to augment the models of 
terms. This demonstrates the extensibility of our approach, and the 
close connection between properties of source terms and properties 
of their denotations (the first advantage listed above). 

2. The H“ Language 

Figure 1 gives the types and terms of H _ ; we write x to denote a 
(possibly empty) sequence of x’s, and if n is a predicate C r, we 
will sometimes write class (it) for C. As in Jones’s theory of qual¬ 
ified types [2], the typical Hindley-Milner types are extended with 



Term variable x £ Var 

Term constants k 

Type variables t £ War 

Type constants K 

Class names C 

nstance names d £ InstName 

Types t,v 

::= t \ K \ f —f t 

Predicates Fred B tt 

::= Ct 

Contexts P, Q 

::= 7T 

Qualified types p 

::= r | 7T => p 

Type schemes Scheme B a 

::= P 1 Vt.a 

Expressions Expr B M,N 

::= x\k \ Ax.M \ MN 


| px.M | let x = M in N 

Class axioms Axiom B a 

::= d : \/t. P => Tt 

Axiom sets A 

G Axiom 

Methods: 


Signatures Si 

£ Var —<■ Pred x Scheme 

Implementations Im 

£ InstName x Var — Expr 

Class contexts 'P 

::= ( A,Si,Im ) 


Figure 1: Types and terms of H . 


qualified types p , capturing the use of predicates. We must also ac¬ 
count for the definition of classes and their methods. One approach 
would be to expand the grammar of expressions to include class 
and instance declarations; such an approach is taken in Wadler and 
Blott’s original presentation [16]. However, this approach makes 
such definitions local, in contrast to the global nature of subsequent 
type class systems (such as that of Haskell), and introduces prob¬ 
lems with principal typing (as Wadler and Blott indicate in their 
discussion). We take an alternative approach, introducing new top 
level constructs (axioms A, method signatures Si, and method im¬ 
plementations Im) to model class and instance declarations. We re¬ 
fer to tuples of top level information as class contexts T, and will 
give versions of both our typing and semantic judgments parame¬ 
terized by such class contexts. Note that this leaves implicit many 
syntactic restrictions that would be present in a full language, such 
as the requirement that each instance declaration provide a com¬ 
plete set of method implementations. 

2.1 H Typing 

We begin with the typing of H“ expressions; our expression lan¬ 
guage differs from Jones’s only in the introduction of p (providing 
recursion). Typing judgments take the form 
P \ V a M : a. 

where P is a set of predicates restricting the type variables in T and 
a, and A is the set of class axioms (the latter is the only significant 
difference between our type system and Jones’s). The typing rules 
for H“ expressions are given in Figure 2. We write ftv(r) for the 
free type variables in r, and extend ftv to predicates tt , contexts P, 
and environments T in the expected fashion. Rules (=> I) and 
(=> E) describe the interaction between the predicate context P and 
qualified types p. Otherwise, the rules are minimally changed from 
the corresponding typing rules of most Hindley-Milner systems. 

We continue with the rules for predicate entailment in H“, 
given in Figure 3. The judgment d : P I Fa 7r denotes that the 
axiom named d proves predicate 7r, given assumptions P and class 
axioms A. We use a dummy instance name, written -, in the case 
that the goal is one of the assumptions. We will omit the instance 
name if (as in the typing rules) the particular instance used is 
irrelevant. We write P Fa Q if there are d\ ... d„ such that di : P II- 
Qi, and I Fa P to abbreviate 0 I Fa P. Our entailment relation differs 
from Jones’s entailment relation for type classes and from our 
prior systems [10] in two respects. First, our system is intentionally 
simplified (for example, we omit superclasses and instance chains). 


P\V\~ax: a 


P \ T,x : t \~ A M : t' 

(->/) -- 

P | r Fa (A x.M) : t ->• r' 


P \ V \- a M : t —¥ t' P\T\- a N:t 

(->£) —-- 

P | T F a (MW) : t 

P \ T,x \ t \~ A M \ t P,n \ T \~ A M : p 

(At)—- (=►/)--— 

P I r Fa a IX.M :r P | r Fa M : 7r => p 

PITFaMitt^p PIFatt 
P | T Fa M : p 

P I r Fa M : a t<£ftv(T,P) P \T \- A M -.Vt.a 

P | T Fa M : Vt.a P \ V Fa M : [r/t\o 

P I T Fa M : cr P \T ,x : a \~ A N : t 

(Let)- 

P | T Fa (let x = M inN) : r 


Figure 2: Expression typing rules of H . 


- : P IFa tt 

(d : Vt.Q' => rr') € A = 


Figure 3: Predicate entailment rules of H . 


Second, we do not attempt to capture all the information that would 
be necessary for an dictionary-passing translation; we will show 
that having just the first instance name is sufficient to determine the 
meanings of overloaded expressions. 

In the source code of a Haskell program, type class methods are 
specified in class and instance declarations, such as the following: 
class Eq t where (=) :: t —» t —» Bool 
instance Eq t => Eq [t] where xs = ys = ... 

We partition the information in the class and instance declarations 
into class context tuples (A, Si, Im). The logical content is captured 
by the axioms A; in this example, we would expect that there would 
be some instance name d such that 

(d : Vf. Eq t => Eq [t]) £ A. 

Haskell’s concrete syntax does not name instances; for our pur¬ 
poses, we assume that suitable identifiers are generated automati¬ 
cally. The method signatures are captured in the mapping Si\ we 
distinguish the class in which the method is defined (along with the 
corresponding type variables) from the remainder of the method’s 
type scheme. For this example, we would have 

M(=®|= (Eq t,t —¥ t —¥ Bool). 

Note that we have not quantified over the variables appearing in the 
class predicate, nor included the class predicate in the type scheme 
f —> t —» Bool. Each predicate in the range of Si will be of the form 
C t for some class C and type variables t, as they arise from class 
definitions. The type scheme of a class member may quantify over 
variables or include predicates beyond those used in the class itself. 
For example, the Monad class has the following definition: 
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(Ctxt) 


{tt * tt' I (d : P = 4 > tt), {d' : P' => tt') € A} 

{(P I r,x, : <y Xi \~A Im(y,d) : o y j) \ (y,d) G dom(/w)} 
P| r,xTTo^h A M:(7 
P I r b <AA/m> M : a 


Figure 4: H typing with class contexts. 


class Monad m where 
return :: a —> m a 
(»=) :: m a —> (a —> m b) 4 m b 

Note that the variable a in the type of return is not part of the 
Monad constraint. Thus, we would have that 

Si(return) = (Monad m,ia.a —> ma). 

The method implementations themselves are recorded in compo¬ 
nent Im, which maps pairs of method and instance names to imple¬ 
menting expressions. 

To describe the typing of methods and method implementations, 
we begin by describing the type of each method implementation. 
This is a combination of the defining instance, including its context, 
and the definition of the method itself. For example, in the instance 
above, the body of the =s= method should compare lists of arbitrary 
type t for equality (this arises from the instance predicate Eq [t] 
and the signature of ==), given the assumption Eq t (arising from 
the defining instance). That is, we would expect it to have the type 

Vf.Eq t => [f] ->■ [t] -¥ Bool. 

We introduce abbreviations for the type scheme of each method, 
in general and at each instance, assuming some class context 
( A,Si,Im ). For each method name x such that Si(x) = (tt, Vw.p), 
we define the type scheme for x by: 


or, equivalently, writing p as Q => t: 

a x =Vi,u. (tt, Q)=>t 

where, in each case, t = ftv(iv). Similarly, for each method x as 
above, and each instance d such that 

• (x, d) € dom(7/w); 

• (d:it. 7r') €E A; and, 

• there is some substitution S such that S 7r = ■n' 
we define the type scheme for x in d by: 

<r x ,d = Vt,u. (P, SQ) => Sr. 

Finally, we give a typing rule parameterized by class contexts 
in Figure 4; in Xj : <r xi , the x, range over all methods defined in the 
program (i.e., over the domain of Si). Intuitively, an expression M 
has type r under (A, Si, Im) if: 


{/3} 

M 

M 

{Let} 


P\T,x:t \- 9 M :t’ P\T\- 9 N:- i 

P | T h* (A x.M)N = [N/x]M : r' 

P I n;*:M : T — y T f x<£fv{M) 


P | T J-* A x.{Mx) = M : t t' 

P | T,x : rJhf.M : r 
P | r F* px.M - \px.M/x\M : r 
P | r M : a P\ T,x:(T\-*N:t 
P | rh* (let x = M in N) = [M/x\N : 
Si(x) = (7r, a) d-.PWS-K 


{Method} 


{VI} 


p I r \-(A,si,im) X = Im(x, d) :Sa 
‘ £MP, r) {(P ri-f MeJV: [r/t]a) I r € GType} 


{VE} 

{^P} 


P I r l-M> M = N : Vt.cr 
P|rh t MsW: 'it.a 
P I r h* M = N : [r/t\a 
P, 7T I r h* M = N : p 


p | r h® m = 
P I r I-* M = N : 


N : 


P\r\~ 9 M = N : p 
Figure 5: Equality for H“ terms. 


2.2 Equality of H Terms 

In this section, we give an axiomatic presentation of equality for 
H _ terms. Our primary concerns are the treatment of polymor¬ 
phism and class methods; otherwise, H“ differs little from standard 
functional calculi. As described in the introduction, our intention is 
to permit reasoning about class methods directly, without relying 
on either a dictionary-passing translation or a preliminary inlining 
step that resolves all method overloading. This results in two un¬ 
usual aspects of our rules: 

• While our presentation gives equality for expressions, it relies 
critically on components of the class context (A,Si,Im) —the 
axioms A to determine which instance solves given constraints, 
and the method implementations Im to determine the behavior 
of methods. 

• The treatment of polymorphism cannot be completely paramet¬ 
ric, and different equalities may be provable for the same term 
at different types; for example, we cannot hope to have uniform 
proofs of properties of the = method when it is defined differ¬ 
ently at different types. 


None of the class instances overlap. More expressive class sys¬ 
tems will require more elaborate restrictions; we give an ex¬ 
ample when extending H“ to support functional dependen¬ 
cies (§5). 

Each method implementation Im(x, d) has the type a x ,d (meth¬ 
ods are allowed to be mutually recursive). 

The main expression has the declared type a, given that each 
class method x, has type <r Xi . 


Equality judgments take the form P Th q, M = N : a, denot¬ 
ing that, assuming predicates P, variables typed as in T, and class 
context 'P, expressions M and N are equal at type a. To simplify 
the presentation, we have omitted equational assumptions; how¬ 
ever, extending our system with assumptions and a correspond¬ 
ing axiom rule would be trivial. The rules are those listed in Fig¬ 
ure 5, together with rules for reflexivity, symmetry, and transitivity 
of equality, and the expected a-equivalence and congruence rules 
for each syntactic form. Rules {/3}, {r;}, {p,} and {Let} should 
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be unsurprising. Rules {=>1} and {=>E} mirror the corresponding 
typing rules, assuring that we can only conclude equalities about 
well-typed expressions. Rule {VE} should also be unsurprising: if 
we have proved that two expressions are equal at a quantified type, 
we have that they are equal at any of its instances. Rule {VI} is 
less typical, as it requires one subproof for each possible ground 
type ( GType ranges over ground type expressions). Note that this 
is only non-trivial for terms involving overloading. Finally, rule 
{Method} provides (one step of) method resolution. Intuitively, 
it says that for some class method x at type a, if instance d proves 
that x is defined at a, then x is equal to the implementation of x 
provided by instance d. 

2.3 Polymorphic Identity Functions 

In the introduction, we gave an example of a polymorphic function 
(invBool) that could be instantiated at all types, yet did not have 
parametric behavior. In this section, we will consider a function 
which does have parametric behavior, but is defined in an ad-hoc 
fashion. We will demonstrate that our treatment of equality allows 
us to conclude that it is equal to its parametric equivalent. 

Our particular example is the identity function. First, we give its 
typical definition: 

idl :: t —> t 
idl x = x 

For our second approach, we intend an overloaded definition that is 
provably equal to the parametric definition. We could produce such 
a definition using instance chains: 

class Id2' t where 
id2' : : t -A t 

instance (Id2' t, Id2' u) => Id2' (t —> u) where 
id2' f = id2' of o id2' 
else Id2 1 t where 
id2' x = x 

This gives an ad-hoc definition of the identity function, defined 
at all types but defined differently for function and non-function 
types. Reasoning about this definition would require extending 
the entailment relation to instance chains, introducing significant 
additional complexity. We present simpler instances, but restrict the 
domain of types to achieve a similar result. 

class Id2 t where 
id2 :: t —F t 
instance Id2 Int where 
id2 x = x 

instance (Id2 t, Id2 u) =t- Id2 (t -¥ u) where 
id2 f = id2o f o id2 

We will use Int to stand in for all base (non-function) types. 

It should be intuitive that, while they are defined differently, 
idl x and id2 x should each evaluate to x for any integer or 
function on integers x. Correspondingly, given a class context T 
that describes (at least) Id2, we can prove that K|, idl = id2 : 
t (we omit the empty context and empty assumptions) for any 
such type r. The case for integers is direct: one application of 
{Method} is sufficient to prove b* id2 3 Xx.x : Int —t Int. 
For functions of (functions of...) integers, the proof has more 
steps, but is no more complicated. For the simplest example, to 
show that 

b* id2 = Xx.x : (Int —> Int) —> (Int —> Int), 
we use {Method} to show 

b* id2 = A/.(id2 of o id2) : (Int -a Int) -A (Int -A Int). 


Relying on the usual definition of composition and {/)}, we show 
b* A/.(id2 of o id2) 3 A/.Ax.id2(/(id2x)) : 

(Int 3- Int) —¥ (Int -3 Int) 

Finally, by two uses of {Method} for id2 on integers, and {r;}, 

b® A/.Ax.id2(/(id2x)) 3 A/./ : (Int 3- Int) —> (Int -A Int) 
and thus the desired result. 

We cannot expect to prove that idl 3 Id2 at all types (i.e., 
idl 3 id2 : Vt.f — ¥ t) without limiting the domain of types. 
For example, there is no instance of Id2 at type Bool; therefore, we 
cannot prove any non-trivial equalities K[, id2 3 M : Bool —> 
Bool. However, if we were to restrict the grammar of types to 
those types for which Id2 is defined (that is, if we define that 
t ::= Int r r), then we could construct such an argument. To 
show that Ki, id2 3 Xx.x : Vf.f —> t, we begin by applying {VE}, 
requiring a derivation Kj, id2 3 Xx.x : t -A t for each ground 
type t. We could construct such a set of derivations by induction 
on the structure of types, using the argument for Int above as the 
base case, and a construction following the one for Int —> Int for 
the inductive case. 

A similar approach applies to the formulation using instance 
chains (class Id2'): we could show that the first clause applied to 
functions, the second clause applied to any non-function type, and 
use induction over the structure of types with those cases. 

3. A Simple Semantics for Overloading 

Next, we develop a simple denotational semantics of H“ programs, 
extending an approach originally proposed by Ohori [12] to de¬ 
scribe the implicit polymorphism of ML. As with the presentation 
of equality in the previous section, the primary new challenges arise 
from the definition of class methods and the treatment of overload¬ 
ing. We will demonstrate that the specialization-based approach is 
well-suited to addressing both challenges. In particular, it allows 
expressions to have different interpretations at each ground type 
without introducing additional arguments or otherwise distinguish¬ 
ing qualified from unqualified type schemes. 

3.1 The Meaning of Qualified Types 

To describe the meaning of overloaded expressions, we must begin 
with the meaning of qualified types. Intuitively, qualifiers in types 
can be viewed as predicates in set comprehensions—that is, a class 
Eq denotes a set of types, and the qualified type Vr.Eq t =#• I —> 
t -A Bool describes the set of types {t —>- t —>• Bool | t € Eq}. 
However, most existing approaches to the semantics of overloading 
do not interpret qualifiers in this fashion: Wadler and Blott [16], 
for instance, translate qualifiers into dictionary arguments, while 
Jones [2] translates qualified types into a calculus with explicit 
evidence abstraction and application. 

Our approach, by contrast, preserves the intuitive notion of 
qualifiers. Given some class context = ( A,Si,Im ), we define 
the ground instances |_crj ,|, of an H“ type scheme a by: 

W* = M 

L J 10 otherwise 
LVi.aj*= [J [[r/t]a\*. 

Equivalently, if we define GSubst{t ) to be substitutions that map t 
to ground types and are otherwise the identity, we have 
LVf.P =>• rj * = {St | S € GSubst(i, lb A SP}. 



We will omit annotation 'll when it is unambiguous. 

In the typing judgments for H“, predicates can appear in both 
types and contexts. To account for both sources of predicates, we 
adopt Jones’s constrained type schemes (P | a), where Pisa list 
of predicates and a is an H - type scheme; an unconstrained type 
scheme a can be treated as the constrained scheme (0 | a) (as an 
empty set of predicates places no restrictions on the instantiation 
of the variables in a). We can define the ground instances of 
constrained type schemes by a straightforward extension of the 
definition for unconstrained schemes: if 'P = (A, Si, Im ), then 

L (P |Vt.e^r)J*={5r|5€ GSubst(t),\\- A ( P,SQ )}. 

3.2 Type Frames for Polymorphism 

We intend to give a semantics for H“ expressions by giving a map¬ 
ping from their typing derivations to type-indexed collections of 
monomorphic behavior. We begin by fixing a suitable domain for 
the monomorphic behaviors. Ohori assumed an underlying type- 
frame semantics; his translations, then, were from implicitly poly¬ 
morphic terms to the interpretations of terms in the simply-typed 
A-calculus. Unfortunately, we cannot apply his approach without 
some extension, as type classes are sufficient to encode polymor¬ 
phic recursion. However, we can adopt Harrison’s extension [1] of 
Ohori’s approach, originally proposed to capture polymorphic re¬ 
cursion, and thus also sufficient for type class methods. 

We begin by defining PCPO frames , an extension of the stan¬ 
dard notion of type frames. A PCPO frame is a tuple 

r = <r type h , r term h , r T ,„, gr, 

(where we will omit the type and term annotations when they are 
apparent from context) subject to the following six conditions. 

1. For each ground type t, 7^ ype [[r] is a non-empty set providing 
the interpretation of r. 

2. For each typing derivation A of T b M : r and T-compatible 
environment 77 , T term [A] 77 is the interpretation of M in T type [r]. 

3. 7V,„ : 7^ ype [r -7 u] x T^ pe |[r] -> T* ype H provides the 
interpretation of the application of an element of r —> v to an 
element of r. 

4. For any f,g € 7 -type [[t -» v], if, for all x £ T^Jt], 
Tr,v(f,x) = T TtV (g,x), then/ = g. 

5. T term [•] and T TiV respect the semantics of the simply-typed A- 
calculus. In particular: 

• If A derives r b x : r, then TJAJtj Sg r](x); 

• If A derives T b MN : v, Am derives V h ill : 
r —>■ v and An derives r b N : r, then TJAJtj = 
7V,„(7IAM]t7,7IA w ]7 ? );and, 

• If Aa derives Fh Ax : t.M : t —¥ v and Am derives T, x : 
t\~ M : v, then T TiV (TlA x \r),d) = TlA M }(r)[x h-7 d\). 

6 . Each set T\r\ is a PCPO with respect to C T , U T and _L r . 

The first five conditions are the standard requirements for type 
frames; the final condition relates the type frame and PCPO struc¬ 
tures of a PCPO frame. Given a PCPO frame T, we can define 
the interpretation of a polymorphic type scheme a as the mappings 
from the ground instances r of a to elements of T[r]. That is: 

r cheme H, = n(r e L< y J*)-'T type [T]]. 

where we will omit the scheme and 'P annotations when it is not 
ambiguous. For example, the identity function Ax.x has the type 
scheme Vf.f —» t. Therefore, the semantics of the identity function 
is a map from the ground instances of its type (i.e., the types 
t —> r) to the semantics of the simply-typed identity function at 


each type. We would expect its semantics to include the pair 
(Int Int,T term [l- Ax : Int.x : Int ->■ Int]) 
to account for the Int —» Int ground instance of its type scheme, 
the pair 

(Bool —> Bool, T term [[i- Ax : Bool.x : Bool —> Bool]) 
to account for the Bool —» Bool ground instance of its type scheme, 
and so forth. Note that if a has no quantifiers, and so |_crj ^ = {r} 
for some type r, then we have 

r cheme H„ = {{(r,fi)}|h£T tyP lr]}, 
and so an element of 7^ cheme |[r] is a singleton map, not an element 
of 7^ ype [t] . Harrison proves that T[cr] is itself a pointed CPO, 
justifying solving recursive equations in T[cr]. 

Theorem 1 (Harrison). Let T be a PCPO frame. Then, for any 
type scheme a, T\cr\ is a pointed CPO where: 

• For any f,g £ T\cr\, f g <=► (Vr £ \a\.f(r) C T 

g(r)); 

• The bottom element _I_ CT is defined to be {(r, _L r ) | r £ [o\ }; 
and, 

• The least upper bound of an ascending chain {fi} C T\cr\ is 
{(r,u T )\relo\, UT =U T (f i (r))}. 

We can define continuous functions and least fixed points for sets 
T\cr\ in the usual fashion: 

• A function/ : T[cr] —? T\er'\ is continuous if/(l_l CT X;) = 
U CT / (f(Xi)) for all directed chains A - ,- in T\cr\. 

• The fixed point of a continuous function / : T\cr\ —> T\c r] 
is defined by fix(/) = U CT (f"(_L CT )), and is the least value such 
that fix(f) =/(fix(/)). 

3.3 Semantics for Overloaded Expressions 

We can now give denotations for (typing derivations of) H“ ex¬ 
pressions. For some type environment T and substitution S £ 
GSubst(ftv(T)), we define an S — T-environment 77 as a mapping 
from variables to values such that rfx) £ T|[(Scr)] for each as¬ 
signment (x : cr) in F. Given a PCPO frame T, a derivation A of 
P | T Fa M : a, a ground substitution S, and an environment 77 , we 
define the interpretation T[A]Sr/ by cases. We have included only 
a few, representative cases here. 

• Case (—»E): we have a derivation of the form 


Ai =-’- A 2 = —'- 

P I T b A M : r -M-' P I F hA N : t 

A = —-- 

P I r b A (MN) : t' 

Let v = St and v' = St', and define 

7IA]S7? = Tv jV i ((tTfAi]S7?)(t7 v), 

(T|A 2 ]577)(W)}}. 

• Case (=>I): we have a derivation of the form 



P, 7T | T I-a M : p 


P I r b A M : 7T =► p 

This rule excludes those cases in which the predicate does not 
hold; thus, we define: 

riA].77 = (p A ^ if f^ ; 

10 otherwise. 
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• Case (=>E): we have a derivation of the form 


P |#Ju M : 7T => p P II- 7T 

A = —-- 

P \ T \~a M \ p 

This rule does not affect the semantics of expression M, and so 
we define: 

TJAjSr, = TlA 1 \Sri. 

• Case (VI): we have a derivation of the form 



^_ P\Vh A M:a t#MP, r) 

P \ T\~aM \ 'it.a 

Intuitively, we interpret a polymorphic expression as the map 
from ground instances of its type to its interpretations at those 
types. As the interpretation of the subderivation Ai is already in 
the form of a such a map, we can interpret A as the union of the 
meanings of Ai for each ground instantiation of the quantified 
variable t. Formally, we define 

rfAlSr, - y TlAtm ^ T])r,. 

• Case (VE): we have a derivation of the form 



P \ T \- A M \ it.a 


P\T\- A M : [r/f]<r 

By definition, [Vt.crJ = Ureor^ l>AH - and so Lb’AM fis 
\it.a\. Thus, the interpretation of A is a subset of the interpre¬ 
tation of Ai; writing f\y for the restriction of a function / to 
some subset Y of its domain, we define: 

TJAJSt? = (TJAijSij)! L [T/(]o-j • 

3.4 Expressions with Class Contexts 

To complete our semantics of H - programs, we must account for 
the meaning of class methods. Our approach is intuitively simple: 
we collect the meanings of the class methods from the method 
implementations in each instance, and use the meanings of the 
methods to define the meaning of the main expression. Formally, 
we extend the interpretation function from derivations of P V h, 4 
M : a to derivations of P | T b^ M : a as follows: 

• Let A be a derivation of P | T b* M : t. Then we know that A 
must begin with an application of (Ctxt) (Figure 4) with one 
subderivation 


* P\r,WTa Xi \-Alm{y,d):a y , i 
for each pair (y, d) € dom (Im) and a subderivation 


for the rr 
program 


Am 


P I Ty x, : CT Xi \ A M : T 


expression M. We enumerate the methods in the 


m *TIM X TKI X ... X TKJ. 


For each method xu we define a function/ : E 4 TJo\ t ,], 
approximating its meaning, as follows: 

fi((b i,b 2 , b m ))St] = U TlA Xi ,4S( V [x] 4 bj]), 

<Xi,d>edom(/ m ) 

and define function/ : E —>■ E, approximating the meaning of 
all the methods in the program, as 

m = (f 1 (b),Mb),...,f m (b)). 

We can now define a tuple b, such that the component h, is the 
meaning of method x,-, as follows: 

b= □ s /"(l s ). 

Finally, we extend the interpretation function to programs by 
TA[S V = TlA M jS( V [xi ->%?,). 

3.5 Polymorphic Identity Functions Revisited 
We return to our earlier example of polymorphic identity func¬ 
tions (§2.3). As before, we consider two definitions of identity 
functions, one given parametrically (idl) and one given by over¬ 
loading (id2). In this section, we will show that the denotations of 
idl and id2 agree at all types for which id2 is defined. By doing 
so, we provide an intuitive demonstration that our denotational se¬ 
mantics captures the meaning of ad-hoc polymorphic and agrees 
with our definition of equality for H - terms. 

We show that TJidl] and T[id2] have the same value at each 
point in the domain of T[id2]; that is, that for any type r £ GType 
such that lb Id2 r, 

TJidl](r —>• t) = T[id2](r r). 

We proceed by induction on the structure of r. In the base case, 
we know that t = K for some non-functional type K. As we have 
assumed lb Id2 r, we must have that K = Int, and, from the 
instances for Id2, we have 

T[id2](T K) = T[id2](lnt Int) 

Ax : Int.x : Int *4. Int]. 

As T|[id1](Int -¥ Int) = T|l- Ax : Int.x : Int -4 Int], we 
have TJidl ](T —» K) = T|[id2](A' —» K). In the inductive case, 
we know that t = to —t ti for some types to and n. From the 
assumption that (tr Id2 (to —¥ ti) and the instances for Id2, we 
can assume that Id2 to, Id2 ti, and that 

T|id2](T -»• t) = T[b A/ : (to -4 n) MofoH t -4 t] 
for some simply typed expressions M and N such that T|M] = 
T|id2](To -4 to) and TJ1V] = TJid2](Ti -4 n). The induction 
hypothesis gives that TJid2] (to -4 to) = TJidl ](t 0 -4 to) 
and that TJid2](Ti -4 ti) = TJid1](Ti -4 n), and thus that 
TJM] = TJb Ax : 7-1 .x : Ti -4 n] and TJA] = TJb Ax : t 0 .x : 
to -4 To]. By congruence, we have 

TJid2](T 4r)oTJA/ : (To 4- ti).(Ax : ti.x)o/o(Ax : t 0 .x)]. 
Finally, assuming a standard definition of composition, and reduc¬ 
ing, we have 

TJid2](T 4 t) = TJA/ : (TO 4 Ti)./] 

= TJA/ : T./] 

= TJidl](T4T). 

In our previous discussion of this example, we argued that if 
the set of types were restricted to those types for which Id2 held, 
then idl and id2 were equal. We can show a similar result here, 
by showing that if we define that t ::= Int | T 4 t. then 
TJidl ] = TJid2]. We begin by showing that they are defined over 
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the same domain; that is, that |Vf. t —>• t\ — {V«.Id2 u => u —> u\. 
By definition, we have 

{Vf. t —¥ t\ = {r -t r | t € GType} 


{Vm. Id2 u =>• u -A u\ = {r-A r.f f £ GType, lb Id2 r}. 

We show that lb Id2 r for all types r by induction on the structure 
of r. In the base case, we know that r = Int, and by the first 
instance of Id2 we have lb Td2 r. In the inductive case, we know 
that t = To —> ti for some types ro,n. In this case, we have 
that [ro/f, Ti/u]r = t —» u and by the induction hypothesis, that 
;,|\f d2 ro and lb Id2 t\. Thus, from the second instance of Id2, we 
can conclude that lb Id2 (r D -A ti), that is, that lb Id2 r. Because 
lb Id2 t for all ground types r, we have 

{t -> r | r £ GType, lb Id2 r} = {r —> t | r € GType}, 
and so TJidl ] and T[id2] are defined over the same domain. We 
have already shown that TJidl] and T[id2] agree at all points at 
which they are defined, and so we conclude T[id1] = T|[id2]. 

4. Formal Properties 

The previous sections have outlined typing and equality judgments 
for H - terms, and proposed a denotational semantics for H“ typ¬ 
ings. In this section, we will relate these two views of the language. 
We begin by showing that the denotation of a typing judgment falls 
into the expected type. This is mostly unsurprising; the only un¬ 
usual aspect of H“ in this respect is the role of the class context. We 
go on to show that the equational judgments are sound; again, the 
unusual aspect is to do with polymorphism ({VI} and {VE}) and 
class methods ({Method}). The H - type system follows Jones’s 
original formulation of OML; we rely on several of his metatheo- 
retical results, such as the closure of typing under substitution. 
Theorem 2 (Soundness of typing). Given a class context T, if A 
is a derivation ofP | T Kj, M : a, S is a substitution, and tj is an 
(ST)-environment, then T|[A]Sry £ T\(SP | So-)]*. 

We will divide the proof into three pieces. First, we show the 
soundness of the judgment P \ T \~a M \ a. Then, we will argue 
that the union of the implementations of a method has the type of 
the method itself. Finally, we can combine these results to argue the 
soundness of P | T b* M : a. 

Lemma 3. Given a class context 'T = (A, Si, Ini) where A is 
non-overlapping, if A is a derivation of P \ T \~a M : o, S 
is a substitution, and r) is a ( ST)-environment, then TJAJStj £ 

n(sp|s<7)]*. 

Proof. The proof is by induction over the structure of derivation 
A. The cases are straightforward; we include several representa¬ 
tive examples. (Meta-variables A„ are as in the definition of T[-] 
above.) 

• Case (=* I). Observe that |_ (S{P,it) \ S p )J = \_(SP \ 5(tt «#, 
p))J. As such, if 

n£xpr}Zrl{S{P,n)\Sp)h, 
then we must also have that 

7IA]Sr ? €7I(S J P|S( 7 r^p))]*. 

• Case (=> E). As entailment is (trivially) closed under substitu¬ 
tion, P lb 7T implies that SP lb Sn for any substitution S; thus, 
we can conclude that {(SP | S(tt => p))J = {(SP | Sp)J. 
Finally, assuming that TJAiJSr? € T[(SP | S (n => p))J, we 
can conclude that T[A] Sp € T[(S P | S p)]. 


• Case (VI). Because a = Mt.a' , we have that 

L*J= u LW*yj, 

and thus that 

TH= u Cn^AKl). 

Thus, assuming that for ground types t, T[Ai](S[f r])r/ 6 
T[(SP | S</)]. we have 

TlAISt/ e ( (J n(SP\Sa')})^m(SP\S0)l 

\reGType J 

• Case (VE). Assuming that T[Ai]S»? k T[(SP | S(Vf.a'))], 

the same argument about ground types as in the previous case 
gives that TlAJSri € T[(SP | Str)]. □ 

The interpretation of typings P | T b* M : a depends on 
the interpretations of the class methods. We will begin by showing 
that the interpretation of each method is in the denotation of its 
type. To do so, we will demonstrate that the interpretation of the 
type scheme of a method is the union of the interpretation of the 
type schemes of its instances. This will show that the union of 
the implementations is in the type of the method, from which the 
desired result follows immediately. 

Lemma 4. The ground instances of the type scheme of a method x 
are the union of its ground instances at each of its instances. That 

KJ= U K*J. 

(x,d)€dom(//n) 


Proof. Let o x = Vt.( it, Q) =4> r, where x is a method of class(n). 
We prove that 

W = U M 

(d,x)edom(Im) 

by the inclusions 

U l^j. 

( x,d)€dom(Im ) 

M ^ U 

(x,d) Gdom(/m) 

We will show only the first inclusion; the second is by an identical 
argument. Fix some v € {cr.vj. By definition, there is some S €. 
GSubst(t ) such that v = Sr and lb Sn,SQ. Because lb Sn, 
there must be some (d : Vm. P => tt') G A and substitution 
S' 6 GSubst(u) such that Sn = S' n and lb S' P. Now, we have 
that <j x ,d = Vt’ .(P, T Q) => Tt for some substitution T\ thus, there 
is some T‘ £ GSubst(t') such that v = T’ (Tt), SP = T' (: TQ ), 
and so v € \? x ,d\ ■ □ 

Lemma 5. The interpretation of the type scheme of a method x is 
the union of the interpretations of its type scheme at each instance. 
That is, 

Tb j = U 

(x,d)edo m (Im) 

Proof. Recall that 

T schem >,| = n(r £ {<7jeJ) .7 -type [[ t] . 
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From Lemma 4, we have that 


r chem >J=s:II re U K,/J ■7^ ype [r]. 

\ (x,a>6dom(/ m ) / 

As 7 _type [[-]] is a function, this is equivalent to 

r chem >J= y n(reM)^1r], 

<*,</> €dom(/m) 

and finally, again from the definition of 7 _scheme |.] j 

r chem >j= y r cheme Ka]. □ 

(x,d)edom(Im) 

Proof of Theorem 2. Finally, we can extend the soundness of our 
semantics to include class contexts. From Lemmas 4 and 5, we 
know that the interpretations of the methods fall in the interpreta¬ 
tions of their type schemes, and so if r? is a 5 — T-environment, then 
r)\xi 1 —t h] is a S — (r,*, : o Xi ) -environment. From Theorem 3, we 
have that TlA M jS{q[xi bt) € T{(SP | S<r)]*, and thus that 

TIAI6J? 6 7?(SP | Soft*. □ 

We would like to know that the meaning of an expression is 
independent of the particular choice of typing derivation. Unfortu¬ 
nately, this is not true in general for systems with type classes. A 
typical example involves the read and show methods, which have 
the following type signatures 

read :: Read t => String —»• t 
show :: Show t => t —» String 

We can construct an expression showo read of type 
(Read t, Show t) =► String —¥ String, 
where variable t can be instantiated arbitrarily in the typing, chang¬ 
ing the meaning of the expression. To avoid this problem, we adopt 
the notion of an unambiguous type scheme from Jones’s work on 
coherence for qualified types [3], 

Definition 6. A type scheme o = Vt.P => t is unambiguous if 

MP) C Mr )• 

As long as we restrict our attention to unambiguous type schemes, 
we have the expected coherence result. For example, suppose that 
A is a derivation of P r b A x.M : cr. We observe that A must 
conclude with an application of (—Fl), say at P 0 \ F \- A Xx.M : 
t —¥ t' , followed by a series of applications of (=>I), (=>E), (VI) 
and (VE). While these latter applications determine cr, we can see 
intuitively that each v € must be a substitution instance of 
r —¥ t', and that the interpretation of A at each ground type must 
be the interpretation of an instance of the subderivation ending with 
(—»I). We can formalize these two observations by the following 
lemma. 

Lemma 1. If a = Vt.Q =$ Sfc and Ai... A„ is a sequence of 
derivations such that: 

• Ai is a derivation of Pi \ r \~a M : n; 

• A n is a derivation ofP \ V\- A M ■. o; 

• Each of A-i ... A n is by (=>I), (=>E), (VI) or (VE); and, 

• Each Ai is the principal subderivation of A,-+i 
then 

(a) There is a substitution S such that t = Sti andPUQ HI- S P i ; 
and, 

(b) For all ground substitutions S, for all v € {ScrJ, there is a 
unique S' such that T\A n \Sqv = TlAijS'qv. 


The proof is by induction on n; the cases are all trivial. We can now 
characterize the relationship between different typings of M. 
Theorem 8 (Coherence of T[-J). If A derives P \ T Kt M : 
cr and A' derives P' \ T' \- A M \ o', where o and o' are 
unambiguous, then for all substitutions S and S' such that SP Hh 
S' P ', S r = S' V, and So = S'o', and for all ground substitutions 
U,TlAl(U°S)=TlA'l{U°S'). 

The proof is by induction over the structure of M. In each case, 
use of the inductive hypothesis is justified by Lemma 7(a), and the 
conclusion derived from the definition of T[-] and Lemma 7(h). 
As an immediate corollary, we have that if A and A' are two 
derivations of the same typing judgment, then T[A]| = T[A']. 
We can also show that, if P \ T La M : o is a principal typing of 
M, with derivation A, and A' derives P \ T La M : o' for any other 
o' , then for each substitution S' there is a unique S such that, for all 
environments q, TJAJSrj 3 T[A']S'? 7 . 

Theorem 9 (Soundness of =), Given a class context 'P, if o is 
unambiguous, P| V \-<s, M = N : o, and Am, An are derivations 
ofP I r h* M : o,P I r|: o, then T[[Am] = TJAjv], 

Proof. The proof is by induction over the derivation of P T Ki< 
M = N : o. The interesting cases are to do with polymorphism and 
overloading. 

• Case {=> I}. We have a derivation concluding 

P, 7T I JLI-m, M = N : p 
P I r h* M = N : it => p 

Let Am, A,v be typing derivations of P r La M : 7r => p 
and P | T l-A N : it => p; without loss of generality 
(because of Theorem 8), assume that each is by (=>I), with 
subderivations A' M , A' N of P, n \ T h* M : p and P, n \ 
r N : p. From the definition of T[[-], we have T[Am] = 
T[Am] and TJAat] = TJAjy]. The induction hypothesis gives 
that TIA'm\ = T[An], and so we can conclude T[A M ] = 
TIAat]. 

• Case {=> E}. We have a derivation concluding 

P | r h* M = N : tt => p PU- A ir 
P\T\- 9 M = N:p 

where 'P = ( A,Si,Im ). As in the previous case, the interpre¬ 
tation of the typing derivations for P \ T h® M : p and 
P | T K|/ A7 : 7T => p are equal, and similarly for the typing 
derivations for N, and thus the induction hypothesis is sufficient 
for the desired conclusion. 

• Case {VI}. We have a derivation concluding 


{(P I r h* M = N : [r/f]cr) | r £ GType} 

P | T hf M s JV : Vt.o 

From the induction hypothesis, we can conclude that, given 
derivations AJ, of P \ T h® M : [r/t]o and AJ, of P \ T 
N : [r/t\o, TIA t m\ = T|[A^]. Let A M derive P \ T h* M : 
Vt.o (and, without loss of generality, assume Am is by (VI)); 
we know that T[Am] = \J TeGType T\ Am]. We argue similarly 
for derivations An of P V Kj. N : Vt.o, and conclude that 

T[Am]=T[A*]. 

• Case {VE}. We have a derivation concluding 

P I r Kf M = N : Vt.o 
P | r h* M = N : [r/t\o 
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Let Am, A n be derivations that M and N have type [r/t]cr; with¬ 
out loss of generality, assume they are hy (VE), with subderiva¬ 
tions Am, A' n that M and N have type 'it.a. From the induction 
hypothesis, we know T[Am] = T[A# ], and from the defini¬ 
tion of T[-] we know that 7 ”[Am] C T[Am] and T|[Ajv] C 
TIA' n J. Thus, we can conclude that T[Am] = ffAjvf- 

• Case {Method}. We have a derivation of the form 

Si(x) = it,a d : P \\~a S n 
P\TP <AtSiM) x = Im(x,d):Sa 
Let Am be the derivation of P | T K|, x : Sa. From the defini¬ 
tion of T[-], we know that TlA M }Sr) = TlA' M }S(r)[xi i-»- bi]) 
where the Xi are the class methods, the b, are their imple¬ 
mentations, and Am is the derivation of P | L, Xi : cr, \~ A 
x : Sa. Since x is a class method, we know that r][xi b] 
maps x to some method implementation bj, and therefore that 
T[[Am] C bj. We also know that bj is the fixed point of a func- 
tion J5«fei,..., b n ))Sq = (J^THA^jS^jx; i-s- bi]), where 
A x ji derives P | T I~a I mix, d') : a xx i> and d is one of the d,. 
Thus, we know that if An derives P \ V Ki, Im(x, d) . Sa, then 
T[Ajv] C b } . Finally, as T[AmJ and T[A W ] are defined over 
the same domain, we have that T[Am] = T[ Ajv] . □ 

5. Improvement and Functional Dependencies 

In the introduction, we set out several ways in which extensions 
of type class systems went beyond the expressiveness of exist¬ 
ing semantic approaches to overloading. In this section, we re¬ 
turn to one of those examples, demonstrating the flexibility of our 
specialization-based approach to type-class semantics. 

Functional dependencies [5] are a widely-used extension of type 
classes which capture relationships among parameters in multi¬ 
parameter type classes. Earlier, we gave a class Elems to abstract 
over common operations on collections: 

class Elems c e | c —> e where 
empty :: c 

insert :: e —» c —» c 

The functional dependency c —> e indicates that the type of a 
collection (c) determines the type of its elements (e). Practically 
speaking, this has two consequences: 

• A program is only valid if the instances in the program respect 
the declared functional dependencies. For example, if a pro¬ 
gram already contained an instance which interpreted lists as 
collections: 

instance Elems [t] t where ... 

the programmer could not later add an instance that interpreted 
strings (lists of characters in Haskell) as collections of code¬ 
points (for simplicity represented as integers): 

instance Elems [Char] Int 

• Given two predicates Elems tv and Elems t'v', if we know 
t = r', then we must have v = v' for both predicates to be 
satisfiable. 

We now consider an extension of H“ to support functional de¬ 
pendencies. Following Jones [4], we introduce a syntactic char¬ 
acterization of improving substitutions, one way of describing 
predicate-induced type equivalence. We then extend the typing and 
equality judgments to take account of improving substitutions. Fi¬ 
nally, we show that the extended systems are sound with respect to 
our semantics. Importantly, we do not have to extend the models of 


terms, nor do we introduce coercions, or other intermediate trans¬ 
lations. We need only show that our characterization of improving 
substitutions is sound to show that the resulting type equivalences 
hold in the semantics. 

5.1 Extending H with Functional Dependencies 

To account for the satisfiability of predicates in qualified types, 
Jones introduces the notion of an improving substitution S for a set 
of predicates P [4], Intuitively, a S improves P if every satisfiable 
ground instance of P is also a ground instance of S P. Jones uses 
improving substitutions to refine the results of type inference while 
still inferring principal types. We will adopt a similar approach, but 
in typing instead of type inference. 

Syntax. We begin by extending the syntax of class axioms to 
include functional dependency assertions: 

Index sets X, Y C N 
Class axioms a ::= C : X Y \ d : VI. P sf; it 
In the representation of functional dependency axioms, we treat the 
class parameters by index rather than by name. If A were the axioms 
for the example above, we would expect to have a dependency 
Elems : {0} {1} € A. 

Any particular class name may appear in many functional depen¬ 
dency assertions, or in none at all. We adopt some notational ab¬ 
breviations: if A is an index set, we write it =x tv' to indicate that 
7r and it' agree at least on those parameters with indices in X, and 
similarly write it ~x it' to indicate that S is a unifier for those 
parameters of 7r and it' with indices in X. 

Improvement. To account for improvement in typing, we need a 
syntactic characterization of improving substitutions. In the case of 
functional dependencies, this can be given quite directly. We can 
give an improvement rule as a direct translation of the intuitive 
description above: 


(Fundep) 


PlbCr P\\-Cv 
(C:X~^Y)£A t —x v 
A b S improves P 


For example, if we have some Q such that Q IF Elems t v and Q lb 
Elems r v ', then (Fundep) says that the any unifying substitution 
U such that U v. apE U v' is an improving substitution for Q. If S 
is an improving substitution for P, then the qualified type schemes 
(P | cr) and (SP \ Sa) are equivalent, and we should be able 
to replace one with the other at will in typing derivations. One 
direction is already possible: if a term has type a, then it is always 
possible to use it with type S a (by a suitable series of applications 
of (VI) and (VE)). On the other hand, there is not (in general) a 
way with our existing typing rules to use a term of type S a as a 
term of type a. We add a typing rule to support this case. 


(Impr) 


SP \ ST \~a M : Sa A b 5improvesP 
P \ r \~ A M : a 


As in the case of (=>I) and (=^E), (Impr) has no effect 
semantics of terms. Thus, if we have a derivation 


the 


Ai = 
A = 


SP|5TbA M:Scr AbS improves P 
P | T b A M : a 


we define that T[A]S'rj = T[Ai]S"?j, where S" o S = S' (the 
existence of such an S" is guaranteed by the soundness of (Fun¬ 
dep)). Finally, we add a rule to the equality judgment allowing us 
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to use improving substitutions in equality proofs. 


{Impr} 


SP | S F h M = N : So A h S improves P 
p | r P (A ,siM) M = N : a 


Validating Functional Dependency Axioms. We must augment 
the context rule to check that the axioms respect the declared de¬ 
pendencies. This can be accomplished by, first, refining the overlap 
check to assure that no axioms overlap on the determining param¬ 
eters of a functional dependencies, and second, requiring that, for 
each dependency C : X Y and each instance P => 7r of class 
C, any variables in the positions Y are determined by the functional 
dependencies of P. Our formalization of the latter notion follows 
Jones’s development [6], We define the closure of a set of variables 
J with respect to the functional dependencies F as the least set J] 1" 
such that 

•7C/+;and 

• If U V £ F and U C J+, then V C J+. 

We write ftvx{C r) to abbreviate Udefine the instantia¬ 
tion of a functional dependency assertion C : X Y at a predicate 
7r = C r, as the dependency ftvx(tt) ftvy{it), and write fd(A, P ) 
for the set of the instantiation of each functional dependency asser¬ 
tion in A at each predicate in P. We can now define the verification 
conditions for axioms and the new version of (Ctxt), as follows. 


{tt tt' I (d : P =>• 7r), (d' : P' => tt'), (classic) : X Y) £ A} 
b non-overlapping(A) 

{Mt ry) ( class(* ) : X ^ Y) £ A} 

b covering(A) 

b non-overlapping{A) b covering(A ) 

{(P I r,x, : <T Xi ht Im(y,d) : o y , d ) \ (y,d) £ dom(/w)} 
p | r ,^ v a m-o 

(Ctxt)- 

P | r b {AtSiM) M : <r 


5.2 Soundness 

The significant challenge in proving soundness of the extended 
rules is showing that when A b S improves P is derivable, S is 
an improving substitution for P. Once we have established that 
result, the remaining soundness results will be direct. We introduce 
notation for the satisfiable ground instances of predicates P: 

|PJ A = { SP | S € GSubst(ftv(P)),\\- A SP}. 

We can now formally describe an improving substitution. 

Lemma 10. Given a set of axioms A such that b non-overlapping(A) 
and\~ covering(A), if A b S improves P, then \_P\a = S PJ a . 

Proof. By contradiction. Assume that A b S improves P; then we 
must have 7To, 7 Ti such that I Ha 7 ro,ihi it\ and there is a functional 
dependency (class( tt) : X Y) £ A such that 7ro =x 7Ti 
but 7ro t^y 7 Ti. We proceed by induction on the heights of the 
derivations of \\~a 7ro, lbj> 7ri. 

• There are distinct axioms d : P => 7To, d' : P' => 7rJ £ A and 
substitutions 50,51 such that 5 q tt' q = -tro and 5i 7ri = 7 t[. 
But then 5o o Si is a unifier for tt' 0 7r', contradicting 
b non-overlapping(A). 

• There is a single axiom d : P => 7 Tq and substitutions So, Si 
such that So 7 Tq = 7 To and Si tt' 0 = tti. We identify two sub- 


■ There is some type variable in ftVY{tt o) \ftvx{no) that is not 
constrained by P. This contradicts b covering(A). 

■ There is some 7 t £ P such that So 7r' and Si 7r' violate 
a functional dependency of class(TT r ). The derivations of 
1b So 7r' and lb Si n 1 must be shorter than the derivations of 
lb 7To, lb tt 1, and so we have the desired result by induction. 

□ 

Theorem 11 (Soundness of typing). Given a class context 'b, if A 
is a derivation of P \ T Kj, Af : o, S is a substitution, and r/ is an 
(, ST)-environment, then T[A]S»j € T[(SP | Scr)]*. 

Proof. We need only consider the (Impr) case. From Lemma 10, 
we have that if T improves P, then T\{P \ cr)]^ = Tf(T P \ 
T cr)]$, and so the result follows from the induction hypothesis. 

□ 

We extend our notion of ambiguity to take account of functional 
dependencies: it is enough for the variables in the predicates P to 
be determined by the variables of r. 

Definition 12. A type scheme o = Vt.P =4> r is unambiguous 
(given class axioms A) if/tv(P) C^v(r)^ (AjP) . 

The previous definition of ambiguity is a special case of this defi¬ 
nition, where fd (A, P) is always empty. As uses of (Impr) do not 
affect the semantics of terms, its introduction does not compromise 
coherence. 

Theorem 13. Ifc r is unambiguous and Ai, A 2 are derivations of 
P I r b* M : <7, then T[Ai] = T\ A 2 ]. 

Theorem 14 (Soundness of =). Given a class context 'b, if o is 
unambiguous, P \ F b^ M = N : o, and Am, An are derivations 
ofP | r b* M : o, P \ T b® N : cr, then T|[Am]m7 = TIAjv]^. 

Proof. Again, we need consider only the {Impr} case. Without 
loss of generality, assume Am and An are by (Impr), with sub¬ 
derivations A' m and A' n . As the interpretations of Am and A,v are 
equal to the interpretations of Am and A' v , the result follows from 
the induction hypothesis. □ 

6. Related Work 

The semantics of polymorphism, in its various forms, has been 
studied extensively over the past half century; however, the par¬ 
ticular extensions of Haskell that motivated this work are recent, 
and have received little formal attention. 

Our approach was inspired by Ohori’s semantics of Core 
ML [12]. While Ohori’s approach describes the semantics of poly¬ 
morphism, he does not represent polymorphic values directly, 
which leads to an unusual treatment of the typing of let expres¬ 
sions. Harrison extends Ohori’s approach to treat polymorphic re¬ 
cursion [1]; in doing so, he provides a representation of polymor¬ 
phic values. Harrison suggests that his approach could be applied 
to type classes as well. 

Ohori’s approach to the semantics of ML is somewhat unusual; 
more typical approaches include those of Milner [7] and Mitchell 
and Harper [9]. Ohori identifies reasons to prefer his approach 
over either that of Milner or that of Mitchell and Harper: both ap¬ 
proaches use a semantic domain with far more values than corre¬ 
spond to values of ML, either because (in the untyped case) those 
values would not be well-typed, or (in the explicit typed case) they 
differ only in the type-level operations. 

The semantics of type-class-based overloading has also received 
significant attention. Wadler and Blott [16] described the mean¬ 
ing of type classes using a dictionary-passing translation, in which 
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overloaded expressions are parameterized by type-specific imple¬ 
mentations of class methods. Applying their approach to the full 
Haskell language, however, requires a target language with more 
complex types than their source language. For example, in trans¬ 
lating the Monad class from the Haskell prelude, the dictionary 
for Monad r must contain polymorphic values for the return and 
(»=) methods. 

In his system of qualified types [2], Jones generalized the treat¬ 
ment of evidence hy translating from a language with overloading 
(OML) to a language with explicit evidence abstraction and appli¬ 
cation. Jones does not provide a semantics of the language with 
explicit evidence abstraction and application; indeed, such a se¬ 
mantics could not usefully be defined without choosing a particular 
form of predicate, and thus a particular form of evidence. 

Odersky, Wadler and Wehr [11] propose an alternative formu¬ 
lation of overloading, including a type system and type inference 
algorithm, and a ideal-based semantics of qualified types. How¬ 
ever, their approach requires a substantial restriction to the types of 
overloaded values which rules out many functions in the Haskell 
prelude as well as the examples from our previous work [10]. 

Jones [5] introduced functional dependencies in type classes, 
and discusses their use to improve type inference; his presentation 
of improvement is similar to ours, but he does not augment typ¬ 
ing as does our (Impr) rule. Sulzmann et al. [15] give an alterna¬ 
tive approach to the interaction of functional dependencies and type 
inference, via a translation into constraint-handling rules; unfortu¬ 
nately, their presentation conflates properties of their translation, 
such as termination, with properties of the relations themselves. 
System Fc [14] extends System F with type-level equality con¬ 
straints and corresponding coercion terms. While we are not aware 
of any formal presentation of functional dependencies in terms of 
System Fc, we believe that a formulation of our (Fundep) rule 
in terms of equality constraints is possible. In contrast to our ap¬ 
proach, System Fc requires extending the domain of the semantics 
while still requiring translation of source-level features (functional 
dependencies or GADTs) into features of the semantics (equality 
constraints). 

7. Conclusion 

We have proposed an alternative approach to the semantics of 
overloading, based on interpreting polymorphic values as sets of 
their monomorphic interpretations, which avoids several problems 
with traditional translation-based approaches. We have applied this 
result to a simple overloaded calculus, and shown the soundness 
of its typing and equality judgments. Finally, we have argued that 
the approach is flexible enough to support extensions to the type 
system, such as allowing the use of improving substitutions in 
typing. We conclude by identifying directions for future work: 

• Practical class systems are richer than the one used in this 
paper. We would like to extend these results to fuller systems, 
including our prior work on instance chains. 

• Dictionary-passing provides both a semantics of overloading 
and an implementation technique. We would like to explore 
whether implementation techniques based on specialization can 
be used to compile practical languages. 

• We claim that our approach avoids making distinctions between 
some observationally equivalent terms (such as in the poly¬ 
morphic identity function example). We would like to explore 
whether adequacy and full abstraction results for the underlying 
frame model can be extended to similar results for our seman- 

• Our definition of equality provides ^-equivalence; however, t] 
equivalence is not sound for Haskell. We would like to explore 


either whether our approach can be adapted to a language with¬ 
out ^-equivalence. 
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